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Compound geometric approximation under a failure 

rate constraint 

Fraser Daly* 


Abstract We consider compound geometric approximation for a nonnegative, integer¬ 
valued random variable W. The bound we give is straightforward but relies on having a 
lower bound on the failure rate of W. Applications are presented to M/G/1 queuing sys¬ 
tems, for which we state explicit bounds in approximations for the number of customers in 
the system and the number of customers served during a busy period. Other applications 
are given to birth-death processes and Poisson processes. 
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1 Introduction and main result 


We consider the approximation of a nonnegative, integer-valued random variable VP by a 
compound geometric distribution. We say that Y has a compound geometric distribution 
if it is equal in distribution to where X, Xi,X 2 ,... are i.i.d. and N ~ Geom(p) 

has a geometric distribution with F{N = k) = p{l — p)^ for A: = 0,1, 2,.... In this work 
we will only consider the case where X takes values in M = {1,2,...}. As usual, the 
empty sum is treated as zero, so that PCP = 0) = P(iV = 0) = p. 

Such compound geometric distributions arise in a number of applications in a variety 
of fields, including reliability, queueing theory and risk theory: see [IH for an overview. 
It is well-known that a compound geometric distribution converges to an exponential 
distribution as p —)■ 0. Explicit bounds in exponential approximation for compound geo¬ 
metric distributions have been given by Brown j^, , Bon , and Pekoz and Rollin [1^ . 

Brown’s work takes advantage of reliability properties of such compound geometric dis¬ 
tributions; such properties will also prove useful in our work. The bounds given by Pekoz 
and Rollin 1^ apply more generally than to compound geometric distributions, relaxing 
the assumptions that N have a geometric distribution and that the Xi be independent. 
Pekoz, Rollin and Ross [l^ give bounds in the geometric approximation of compound 
geometric distributions. Note that some of the above-mentioned bounds apply in the case 
where N is supported on {0,1,...}, and some in the case where N has support {1, 2,...}. 
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Here we will consider the approximation of W by our compound geometric random 
variable Y using the total variation distance, dehned by 


dTvmw),C(Y)) = sup |P(1H e H)-P(r e H)| , 

A<ZZ+ 


where Z"*" = {0,1,2,...}. Some work on compound geometric approximation in total 
variation distance has been done by Daly j^, whose main application is to hitting times 
of Markov chains in a quite general setting. We build upon that work by presenting bounds 
which are more straightforward to evaluate, but which require some knowledge about the 
behaviour of the failure rate of the random variable W. We recall that the failure rate 
(or hazard rate) of a nonnnegative, integer-valued random variable W is dehned to be 


rwif) 


= J) 

¥{W > j) ’ 


jez+. 


Some authors use an alternative dehnition, taking the failure rate of W to be 


rwU) 


F(H/ = j) 
F{W > j) ’ 


jez+. 


The failure rate of a continuous random variable may be dehned analogously, by replacing 
the mass function in the numerators of the above with a density function. 

Note that our bounds may be applied in conjunction with bounds for exponential or 
geometric approximation of compound geometric distributions discussed above. 

In our main result. Theorem 11.11 below, we will assume that we have 5 > 0 such 
that rw{j) Y S for all j. Given such a 6, the total variation distance between W and a 
compound geometric distribution may be ehectively bounded by computing their expecta¬ 
tions. This is in contrast to the bounds presented in [^, where more detailed information 
must be known about W to allow them to be computed. 

We note the work by Brown and Kemperman ^ and Brown j^, who hnd bounds on 
the distribution function and variance of a random variable, respectively, under bounds 
on its failure rate. Explicit bounds in probability approximation for a random variable 
with a bounded failure rate have also been derived in the recent work of Brown . He 
gives sharp bounds in exponential approximation for random variables whose failure rate 
may be bounded from above, with applications to compound geometric distributions, 
and hrst passage times of birth-death processes and other reversible Markov chains (in 
continuous time). Note that here we are working with discrete random variables, under 
the assumption of a lower bound on the failure rate. Our results complement, but do not 
overlap with. Brown’s work. 

After stating our main theorem, a hrst application (approximating the equilibrium 
number of customers in an M/G/1 queueing system) will be presented to illustrate our 
bound. Further applications will be given in Sections [2] and [31 

In Section [2] we will consider the well-studied problem of geometric approximation 
for random variables with increasing failure rate. We will consider two straightforward 
applications of our Theorem 11.11 (to Poisson processes and the Polya distribution) which 
allow us to explicitly compare our bound with a similar result from [l^. In Section [3] we 
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consider compound geometric approximation for random variables with decreasing failure 
rate. In particular, we consider the number of customers served during a busy period of 
an M/G/1 queue, and the time to extinction of a discrete birth-death process. 

The proof of our Theorem II.II is given in Section 01 The proof uses Stein’s method (see 
211 and references therein), building upon previous work on Stein’s method for geometric 
1^ and compound geometric approximation. Finally, in Section 0] we give related 
results which we illustrate with short examples. 

For future use, for any 0 < p < 1 and positive, integer-valued random variable X with 
u = 1 — dTvi^{X), C{X + 1)), we dehne 


H,{X) = min h+ (1 -p)r{X > l).p h + ■ (l-l) 

We now state our main result. 


Theorem 1.1. Let W be a nonnegative, integer-valued random variable with P(1F = 
0) = p G (0,1) and rw{j) > <5 > 0 for all j. Let Y = where N Geom{p) and 

X,Xi,X 2 ,... are i.i.d. positive, integer-valued random variables. If 


EW > 


p 

(l-p)5’ 


then 

dTvmW),CiY)) < Hp{X) {EY - EW) . 

Note that under the conditions of this theorem, it is straightforward to show that 
EW < 5“^, so that the resulting upper bound is nonnegative, as expected. 


1.1 Application to the number of customers in an M/G/1 queue 

Consider an M/G/1 queueing system in equilibrium, with customers arriving at rate A and 
with i.i.d. service times having the same distribution as the random variable S. Letting 
p = APIS'], we assume throughout that p < 1. Let W be the number of customers in the 
system. It is well-known that 


E{W = 0) = 1 — p, and EW = p + 


p^E[S^] 


2(l-p)(E5)2- 

See, for example, page 281 of [ij. 

Let Rj denote the residual service time of the customer currently being served in the 
queue, conditional on the event {W = j}. Then Ross 2l| shows that 


rwU) = 


1 -p 


and ERj < sup E [S' — flF > f] 


XERj 


We may thus apply Theorem 11.11 with the choice 

1-p 


6 = 


A supjgR+ E [S' — t|S' > t] 
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The random variable S is said to be new better than used in expectation (NBUE) if we 
have that E [S' — tlS” > t] < MS' for all f > 0. In this case we may take 5 = p~^{l — p) 
and, in the notation of Theorem ll.il we may then take X = 1 a.s., so that Y simply has 
a geometric distribution and Hp{X) = p. We thus obtain the following. 

Corollary 1.2. Let W be the number of customers in an M/G/1 queueing system in 
equilibrium as above. If S is NB UE then 

dTv{C{W), Geom{l - p)) < p'^ (^1 - • 

Note that, as expected, this upper bound is zero if the service time S has an exponential 
distribution (which is indeed NBUE). 

Finally in this section, we refer the interested reader to the book by Muller and Stoyan 
(lij l. who prove many stochastic comparison and monotonicity results for queueing models 
(and in many other applications), and derive associated bounds on quantities such as the 
mean waiting time and mean busy period for stationary queues. Some of their work also 
takes advantage of reliability properties of the underlying random variables, as we have 
done here. 


2 Geometric approximation for IFR distributions 

In the notation of Theorem 11.11 since p = P(hU = 0) we have that rw{0) = p{l — p)~^. If 
the failure rate ^^(i) is increasing in j, this may clearly serve as the lower bound 6. In 
this case, we may let the random variable X be 1 almost surely, so that Y has a geometric 
distribution. Noting that Hp{X) = p in this case, we obtain the following. 

Corollary 2.1. Let W be a nonnegative, integer-valued random variable with P(1U = 
0) = p. IfW has increasing failure rate (IFR) then 

drvmW), Geom{p)) < I - p{l + EW). 

Note that we do not need the monotonicity of to obtain such a bound; it suffices 
to have rw{j) — ^w(O) for all j G . 

Geometric approximation theorems for IFR random variables are well-known. We 
use the remainder of this section to give two explicit examples in which we can compare 
our Corollary 12.11 to the main theorem of Obretenov [l^ . Obretenov does not use total 
variation distance but employs the Kolmogorov distance dx dehned by 

dxicm^^y)) = sup |p(iu< j)-ny < j)\ ■ 

jei.+ 

Since total variation distance is stronger than Kolmogorov distance. Corollary 12.11 also 
bounds the Kolmogorov distance between W and our geometric distribution, and thus 
Obretenov’s bound may be compared with ours. 

Obretenov [l^ shows that if hU is a nonnegative, integer-valued, IFR random variable 
with EW = p then 

d.(£W.Geom((l + ,n)<^(l- , (2,1) 
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Note that Obretenov chooses a geometric distribution having the same expectation as W, 
while we have chosen ours to have the same probability of being zero. 


2.1 Application to the Polya distribution 

Suppose m balls are distributed randomly among d > 2 urns, in such a way that all 
assignments are equally likely. Let W count the number of balls in the hrst urn. Then 
W ~ Pya(m, d) has a Polya distribution, with 


P(VP = k) 


/d-\-7n—k—2\ 

V m—k ) 

( d-\-m—l\ 

\ m J 


0 < k < m. 


It is straightforward to show that, with this definition 


P(iy = kf > = k- 1)P(IP = k + l), 

for 1 < fc < m. Hence, W is IFR. See, for example, page 177 of [l^. We may thus apply 
Corollary 12.11 which gives 

(2.2) 

where Y ~ Geom . In this case, Obretenov’s bound (12.1 p is 

dx (C{W),Geom (—^373 -7' (2-3) 

V \d + mj J - {d+l){d + m) ^ ' 

Our bound is better than (12.3p for d large enough (specihcally, + dm — 3d — m > 0). 
Note, however, that (12. 2 p bounds the total variation distance, while (12.3p bounds only 
the weaker Kolmogorov distance. Our (12.2p also improves upon bounds for geo metric 
approximation of the Polya distribution in Example 3.1 of and Section 4 of |2nj|. 

A simple lower bound corresponding to (12.2p is given by 

dTv(c(w)x(v)) > mw = 1) - p{y = 1)1 = —rii ■ (2.4) 

(a + m — 2)(a + m — 1)"' 

In the case where d is of order 0{m), this lower bound is of the same order as each of the 
upper bounds (12.21) and (12.31) . Some numerical comparison of the bounds (I2.2p - (I2.4I) is 
given in Table [TJ 


2.2 Application to Poisson processes 

Let {N(t) : f > 0} be a homogeneous Poisson process of rate A and let T be an IFR 
random variable independent of {N{t) : t > 0}. By Corollary 5.2 of 22(], N{T) is also 


IFR. Since F{N{T) = 0) = EA^(r) = AET, and Var(iV(T)) = AET + A2Var(T), 

we have from our Corollary 12.11 that 


drv {C{N{T)), Geom (Ee”^^)) < 1 - (Ee”^^) (1 + AET) 


(2.5) 
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Table 1: Geometric approximation for W ~ Pya(m,d). Values of p = (d—l)(d + m — l) ^ 
given to 4 d.p.; total variation distance between W and V ~ Geom(p), and bounds 
(O-dll given to 4 s.f. 


m 

d 

V 

dTv{C{W)X{Y)) 

(ra 

ra 


200 

200 

0.4987 

0.0009458 

0.002506 

0.004975 

0.0006281 

200 

10 

0.0431 

0.03055 

0.09569 

0.1732 

0.0001981 

10 

10 

0.4737 

0.02255 

0.05263 

0.09091 

0.01385 

10 

200 

0.9522 

0.0002190 

0.0002392 

0.0004738 

0.0002190 


while Obretenov’s result (12.Ih gives 


dK (£(iV(T)), Geom ((1 + AET)”^)) < 


AET 
1 + AET 


/ Er + AVar(r)A 
V ~ ET(1 + AET) J ■ 


( 2 . 6 ) 


To give an explicit example where we may compare these bounds, suppose that T ~ 
r(Q;, /3) has a gamma distribution with density function ^(x) proportional to for 

some a > 1 and /3 > 0. Then T is IFR. Since 


the bounds fl2.5p and fl2.6p become, respectively, 

dry ^>C(V(T)), Geom +-^ ^^<1-^1 + -^ ^1 + —^, (2.7) 

and 

<;.(£(iV(r)).Geo.n(^))<f;|^. (2.8) 

To compare 02.71) and 02.8p . we use Taylor’s theorem to note that for small A the 
upper bound of 02.7p is approximately equal to a(a — 1)A^ (2/3^) which is smaller than 
the upper bound of 02.8p whenever (V^ — l)/3 > otX. Finally, we again emphasise that 
02.7p bounds the total variation distance, while 02. 8 p bounds only the weaker Kolmogorov 
distance. 

We return to further applications of our results to Poisson processes in Section O 


3 Approximation for DFR distributions 

In this section we present some further applications of our main result, Theorem ll.il We 
will consider random variables which have the decreasing failure rate (DFR) property, 
so that the lower bound 5 may be taken to be lim^^oo ^w(i)- The applications we will 
consider will be to the number of customers served in a busy period of an M/G/1 queue. 
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and to the time to extinction of a discrete birth-death process. In each case we will 
construct the relevant random variable W as the time at which a particular Markov chain 
on Z~^ hrst visits the origin. In this case, Shanthikumar 2^ gives sufficient conditions for 
the DFR property to hold and an expression for the failure rate which will allow us to 
apply our Theorem 11.11 

Let {Zn ■ n > —1} he a discrete-time Markov chain with state space Z"*" and transition 
matrix P = (pij). Let the entries of the matrix P~^ = (p^) be given by pfj = J2T=jPik for 
i,j G Z+. Assume that the Markov chain starts at Z’_i = 1 and dehne the hitting time 


W = min{?7, > 0 : = 0} . 


(3.1) 


Without loss of generality in what follows, we may assume that the state 0 is absorbing. 
We have chosen to start our Markov chain at time —1 so that the support of W matches 
that of our compound geometric distributions. 

We say that the matrix is TP 2 if ptkPp ^ PtiP~jk for ah i < j and k < 1. Theorem 
3.1 of 2^ states that if P~^ is TP 2 then W is DFR. From the proof of that theorem, we 
also have that for such DFR hitting times W, ry/{j) > 6 for all j G Z'^, where 


6 = PiQ lim ^{Zj = i\Zj > 1) 

f ^ j^OO 


2=1 


(3.2) 


In order to evaluate this expression, we will therefore need an expression for the limiting 
distribution of our Markov chain conditional on non-absorption. 

To translate a lower bound on rw{j) irito a lower bound on rw{j), we will use the 
following lemma, whose proof is straightforward and therefore omitted. 

Lemma 3.1. Let W be a nonnegative, integer-valued random variable with rw{j) > d for 
all j G Z+. Then rw{j) > h ("l — h) for all j G Z+. 


3.1 Customers served during a busy period of an M/G/1 queue 


Consider the M/G/1 queue of Section [m with customers arriving at rate A and i.i.d. 
service times with the same distribution as S. Again letting p = AES', we will assume 
throughout that p < 1. We will also assume that S is IFR, so that, by Shanthikumar’s 
Theorem 5.1, the number of customers served during a busy period is DFR. 

Consider the embedded Markov chain : n > —1}, where Z_i = 1 and Zn represents 
the number of customers in the system after the departure of customer n (with customers 
labelled 0,1, 2,...). Then IF -|- 1, with the hitting time IF given by fl3.ll) . is the number 
of customers served during a busy period of the queue. 

This Markov chain has the transition probabilities poo = 1 and p^ = g{j -f 1 — z), 
where 


„(k) = i P it > 0 . 

1 0 if*:<0. 


Hence, 


p = P(1F = 0) = pio = Ee 


-AS 


(3.3) 
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We also have that EhF = p(l — p) h See, for example, page 217 of [131. 

Since Pio = 0 for i > 1, the lower bound 6 given by fl3.2p becomes 6 = p9, where 
6 = hm,-^noP (Zj = l\Zj > 1). To hnd an expression for 6 we use a formula due to 
Kyprianou [l^. Suppose the density of the service time S has Laplace transform p, and 
let ^ be the real solution of 1 + Xp'{s) = 0 nearest the origin. By a result on page 829 of 
1^, we then have that 


e - A + A(^(o 
(e - A)(^(A) 


(3.4) 


Using Lemma 13.11 we may then take the lower bound 5 = p6{l — p9) ^ in Theorem 11.11 
and we obtain the following. 


Theorem 3.2. Let W + 1 be the number of customers served in a busy period of an 
M/G/1 queue with arrival rate A and service time S. Let p and 9 be given by hS. 3\) and 
o, respectively. Suppose that S is IFR and that p = AES' < 1. Let N ~ Geom{p) and 
Y = where X, W, -^ 2 , • • • i.i.d. with (1 — p)9'EX > 1 — p9. Then 

dTvmW),C{Y)) < Hp{X) ^ 

V p 1 - p 



The number of customers served in a busy period of this queueing system is closely 
related to the total progeny of a certain branching process, and so our Theorem 13.21 may 
also be applied in that setting. If we dehne the offspring of a customer to be the other 
customers who arrive while he is being served, the number of customers served during a 
busy period has the same distribution as the total progeny of the customer initiating the 
busy period. See page 284 of [H for further details. 

To illustrate our Theorem 13.2( we consider the example where S ~ r{k,f5) has an 
Erlang distribution for some integer k > 1 and some /3 > 0. In this case, S is indeed IFR. 
Since ES” = k(d~^, our condition on p requires that kX < /?. 

Using (13.311 . p = (1 + X(3~^) . The Erlang density has Laplace transform ip{s) = 

I3^{s + /9)“^, and since kX < j3 li is straightforward to check that f — j3 

and that therefore, by (13.4p . 
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where A = (/3 + A) ( ^ ) “ ^ • 


Theorem 13.21 thus requires that we choose 


EX > 


{A-im+xf 


/5^)' 


(3.5) 


If we choose X such that equality holds in (I3.5p . the upper bound of Theorem |32] becomes 
Hp{X)U < U, where 


Some numerical illustration of this bound is given in Table O 


(3.6) 
























Table 2: Some values of the upper bound U of (13.61) for the Erlang service time example. 
Invalid parameter choices are marked with 


k 

A 



/5 



0.1 

0.5 

1 

1.5 

10 


0.001 

0.1134 

0.0470 

0.0327 

0.0265 

0.0101 


0.005 

0.3183 

0.1134 

0.0769 

0.0617 

0.0229 

1 

0.01 

0.5652 

0.1714 

0.1134 

0.0901 

0.0327 


0.05 

> 1 

0.5652 

0.3183 

0.2388 

0.0769 


0.1 

- 

> 1 

0.5652 

0.3978 

0.1134 


0.001 

0.3784 

0.1985 

0.1588 

0.1406 

0.0846 


0.005 

> 1 

0.3783 

0.2781 

0.2378 

0.1294 

5 

0.01 

> 1 

0.5619 

0.3784 

0.3137 

0.1588 


0.05 

- 

> 1 

> 1 

0.8366 

0.2781 


0.1 

- 

- 

> 1 

> 1 

0.3784 


0.001 

0.5777 

0.2827 

0.2271 

0.2025 

0.1282 


0.005 

> 1 

0.5777 

0.4027 

0.3402 

0.1874 

10 

0.01 

~ 

0.9890 

0.5777 

0.4614 

0.2272 


0.05 

~ 

- 

> 1 

> 1 

0.4027 


0.1 

- 

- 

- 

> 1 

0.5777 


3.2 Time to extinction of a birth-death process 

We let {Zn : n > —1} be the Markov chain with Z_i = 1, poo = and 

if j = z + 1, 
if j = z - 1, 
if j = z, 
otherwise, 

for z > 1. Let W be the hitting time dehned by (13.ip : the time when this discrete 
birth-death process becomes extinct. 

Clearly we have that p = ¥{W = 0) = and, from (13.21) . 

5 = qi lim ^{Zj = l|Zj > 1) . (3-7) 

j^oo 

To hnd an expression for this limit, we use the famous Karlin-McGregor [l^ representa¬ 
tion of the rz-step transition probabilities of this chain. Dehne tti = 1 and 

Pi-P2---Pj-i 

-Kj = - , 

^2 W3 • • ■ Qj 

for j > 2. Then Karlin and McGregor [l^ show that there is a unique positive measure 
■0, of total mass 1, supported on [—1,1] such that 

Pij{n) =¥{Zn = j\Zo = i) = TTj J x"'Qi{x)Qj{x)d^jJ{x), ioTi,j>l, 
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where {Qj : j > 1} is a sequence of polynomials (orthogonal with respect to "0) satisfying 
the relations Qi{x) = 1, piQ 2 {x) = x — ri, and 


xQj{x) = qjQj-i{x) +rjQj{x) +pjQj+i{x), 


for j > 2. Following the notation of van Doom and Schrijner [^, Qj+i has j distinct 
zeros which we denote xy < X 2 j < ■ ■ ■ < Xjj. We write p = limfc_j.oo Xkk and 




/°i(-a:)^d0(x) 


In what follows we make the following assumptions: 


OO 

= CX), (3.8) 

k=l 

h < 1, (3.9) 

lim Cnii/j) = 0, (3.10) 

n^oo 

fj > - for all j > I ■ (3.11) 


Assumption (13. 8 p guarantees that the birth-death process does eventually reach extinc¬ 
tion: see Section 4 of [25]. Assumptions (13.Op and (I3.10p are used to ensure that the limit 


fl3.7p exists, and are taken from Lemma 4.1 of 2^. Finally, assumption fl3.1ip is sufficient 
to guarantee that the transition matrix of our birth-death chain is and hence that 


the extinction time W is DFR. See page 6 of [lOj and Remark 3.2 of [24 


We note that Section 3 of 2^ gives several conditions under which the assumption 


(I3.10p holds and which may be used to check its validity in practice. 

Under the assumptions fl3.8ll - fl3.lll) . Lemma 4.1 of gives us that 6 = 1 — p, and 
so (by Lemma [3.ip we may take 6 = p~^{l — p) in Theorem ll.il Applying that result, we 
then obtain the following. 


Theorem 3.3. Let W be the time to extinction of the discrete birth-death process de¬ 
fined above. Assume H3.8\) - I[37T1\) hold. Let N ~ Geom{qi) and Y = where 

X, Xi, X 2 ,... are i.i.d. with (1 — ^^(l — p)¥^X > qip. Then 

dTv{C{W),C{Y)) < H,fiX) . 

Finally, note that Brown considers exponential approximation for hitting times of 
birth-death processes in continuous time, taking advantage of monotonicity of the failure 
rate in his work. See also the references within Brown’s work. 


4 Proof of Theorem 11.11 

We use this section to give the proof of our main result. Theorem 11.11 The proof is 
based on Stein’s method for compound geometric approximation. Stein’s method was 
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first applied to the problem of approximation by a geometric distribution by Barbour and 
Griibel and Pekoz E- , More recent developments in Stein’s method for geometric 
approximation are given in [3 and [l^. Stein’s method has previously been used in the 
compound geometric case in |9|, and compound geometric distributions have appeared in 
conjunction with Stein’s method in papers by Bon j^, Pekoz and Rollin [l^, and Pekoz, 
Rollin and Ross 191. The interested reader is also referred to and references therein 
for an introduction to Stein’s method more generally. 

Throughout this section we will let W and Y be as dehned in Theorem ll.il and Hp{X) 
be given by fll.ip . We dehne the random variable V to be such that V+X =st (1R|1P > 0), 
where =st denotes equality in distribution. 

We will employ the usual stochastic ordering in what follows. Recall that for any two 
random variables T and U, T is said to be stochastically smaller than U (written T < st U) 
if P(T > j) < ¥{U > j) for all j. 

Lemma 4.1. Let W he a nonnegative, integer-valued random variable with P(hP = 0) = p 
and rw{j) > 6 > 0 for all j & lY. Let V be as above and suppose that EX > 
ThenV + X <stW + X. 

Proof. From the dehnition of V, the required stochastic ordering will follow if 


(1 - p)¥{W + X >j)> P(VF > j ), 


(4.1) 


for all j e Z+. 

Conditioning on X (which is independent of W), we write 


F{W + X >j)= F{W > j) + E 


P(W = k) 

.k=j+l-X 


Using this, we rearrange fl4.ip to obtain that the required stochastic ordering holds if 

3 


¥{W > j) 

Now, if rw{k) > S for all k then 
1 


E 


Y = k) 

,k=j+l-X 


> 


p 


1 — p 


(4.2) 


F{W > j) 


E 


Y F{W = k) 

lk=j+l-X 


> E 


Y 

,k=j+l-X 


> 5EX. 


□ 


Hence, if EX > then fl4.2p holds and our lemma follows. 

The proof of Theorem 11.11 then goes along similar lines to that of Proposition 3.1 in 
For A C Z+ we let : Z+ i—)■ M be such that /yi(0) = 0 and 

/(j e H) - P(F e H) = (1 - p)FfA{j + X) - U{j) , (4.3) 

/(•) denoting an indicator function. We then note the following property of /^. 
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Lemma 4.2. Let Ja be as above. Then sup^g^+ |/^(j + 1) - fA{j) \ < lHp{X). 

Proof. From the definition 04.31) . it is easy to check that 

CX) 

fA{3) = -Y,^i-py[n3+x^ + ---+x,^A)-ny+x^ + ---+x,^A)] , (4.4) 

i=0 

from which it follows that |/A(i + 1) — /yi(i)| may be bounded by 

OO 

~ pY |lP(i + 1 + Xi + ■ ■ ■ + Xi ^ A) — P(j + Xi + ■ ■ ■ + Xi ^ A)\ . (4.5) 

i=0 

To complete the proof, we bound 04.51) in two different ways. Firstly, letting N ~ Geom(p), 
we write this as 

-|P(j + 1 + Xi + ■ ■ • + Xjv G A) — P(j + Xi + • • • + Xjq G v4)| 

P 

< ^TviCiY), CiY + !))<! + ii^P(X > 1), (4.6) 

p p 

where the final inequality uses Theorem 3.1 of j^. Alternatively, we have that 

|IP(j + 1 + + ■ ■ • + G A) — P(j + Xi + ■ • • + Xj G A) I 

< dry(/l(Xi + ■ ■ • + Xj), C{Xi + ■ ■ ■ + Xj + 1)). 

We may then follow the analysis of Theorem 3.1 of [l^ to obtain 

UaU +1) - n(i)i < 1 + . 

where u = 1 — dTv{tC-{X), £(X + 1)). This completes the proof. □ 

Remark 4.3. Lemma 14.21 improves upon part of Theorem 2.1 of [^, by presenting a 
sharper bound and removing a restriction on the support of X. This may, in turn, be 
used to improve on Proposition 3.1 of [^. For a general X, Lemma [4.21 gives the bound 
|/^(j + 1) — fA{j)\ < P~^ (which is the same bound given in j^), but also shows that a 
better bound is possible when P(X = 1) is large (informally, when Y is close to a geometric 
distribution) or when X is smooth (in the sense that the total variation distance between 
X and X + 1 is small). Note that the bound \fA{j + 1) — IaUY < P~^ is the best 
possible without imposing restrictions on X. To see this, suppose that X = 2 a.s. and let 
A = 2Z+. In this case, fl4.4p gives /^(j + 1) — /^(j) = (—The inequalities fl4.6l) 
are also sharp, in that if X ~ Geom(p) then dry(£(y), C{Y + 1)) = p. 

Using the definitions of Ja and V, we may write 
P(1U G A) - P(y G A) = (1 - p)E [fA{W + X) - fA{V + X)] 

OO 

= (1 -P) E + 1) - + X>j)-F{V + X> j)] . (4.7) 

j=0 
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Now, under the conditions of Theorem II. 11 Lemma [4.11 gives us that V + X <st W + X. 
Hence, bounding fl4.7l) using Lemma [4.21 gives 

|P(VL e H) - P(y e H)| < ^^~^\ p{X)E [VL - H] = Hp{X)E [y - VL] , 

where the hnal equality follows from the dehnition of V. We have thus established The¬ 
orem o 

Remark 4.4. The techniques of this section may also be used to bound the Wasserstein 
distance dw{C,{W), C{Y)) = sup^j \Eh{W) — E/i(y)|, where the supremum is taken over 
all 1-Lipschitz functions h. Under the conditions of Theorem 11.11 we follow the above 
methods to obtain the bound dvv(-^(hU),'^(y)) < py — EW. This bound is sharp. 
Suppose, for example, that W is IFR and Y ~ Geom(p). Then W <g± Y (j^. Theorem 
l.B.l]) and so dwmW), C{Y)) = EY- EW. See Theorem l.A.ll of 0. 


5 Some further results 

In this section we note two results, closely related to Theorem ll.il which may prove useful 
in applications. Potential applications are indicated for each via short examples. 


5.1 Approximation for translated distributions 

Let W be as in Theorem 11.11 and let Wm =st iW ~ 'm\W > m) for some m G Z+. In 
many cases it is more natural to seek a compound geometric approximation for (for 
some m > 1) than for W. This may be achieved in a straightforward way using Theorem 
11.11 We note that the failure rate of Wm may be bounded from below by 


5m = mmrwii) > 5 , 

j>m 


(5.1) 


and that if W has monotone failure rate, Wm inherits this property. Letting 

r[W > m) 

we may apply Theorem 11.11 to Wm to obtain the following corollary. 


(5.2) 


Corollary 5.1. Let W be a nonnegative, integer-valued random variable, m G Z"*", and 
Wm =st iW — m\W > m). Let 5m and Pm be given by H5.1\) and respectively. Let 

Y = Ya=i where N Geom{pm) and X, Xi, X 2 ,... are i.i.d. with 


EA > 


Pn 


Then 


(1 Pm) 5 m 

dTv{C{Wm),C{Y)) < HpjX) (Ey - EW^) . 
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To illustrate one situation in which such a result would be useful, consider the Markov 
chain {Z„ : n > — 1 } with state space { 0 , 1 , 2 }, Z_i = 2 , and transition matrix 

/ 1 0 0 \ 

I tti ei , 

V «2 ^2 £2 / 

where, for i G {1,2}, we have that Oj, f3i, G (0,1) with ctj + = 1 and where we 

consider to be small. 

Letting W be the hitting time W = min{? 7 , > 0 : = 0}, the most natural geometric- 

type approximation in this setting is to approximate Wi by a geometric distribution with 
parameter close to ai. We will show that this is easily achieved using Corollary 15.11 
Elementary calculations show that, since P(hT = 0) = a 2 , 


Pi 


-L-f- 

1-02 


EVE 
1 - a ;2 


— 1 , and EhE 


(1 ~ (^l)^2 + /52(1 + ^i) 
(1 - /3i)(l - 62 ) - /52ei ■ 


For simplicity in what follows, we will assume that ai > a 2 and that / 3 i (/92 -|- € 2 ) > 
P 2 W 1 +Li)- These conditions are sufficient to guarantee that W is IFR (using Theorem 
4.1 of |22|). In this case we may take X = 1 a.s. in Corollary 15.11 as we did in the IFR 
examples we have previously considered. 

Corollary 15.11 then gives us the bound dTvi'^iWi), Geom(pi)) < A + B + C, where 


A 


®l /^2 + CI2G2 

Q;2(1 ~ 0^2) + ~ ®2)(1 — 0!2 — €2) 


B 


(ai/32 + a2e2)(l -|- ei — 62 ) 
(1 ~ cr 2 )(®i ~ + ^^ 2 )) 


and 


^ _ 62(01 - a 2 ){ai /32 + €12^2) 

(1 - a 2 )^(a 2 ei + ai(l - ^ 2 )) ’ 

We conclude this illustration by noting that if either ei = 62 = 0 or oi 
upper bound is zero, as expected. 


02 then our 


5.2 Hazard rate ordering 


In this section we will need the hazard rate ordering. For two nonnegative random vari¬ 
ables T and U, T is said to be smaller than U in the hazard rate order (denoted T U) 
if ^r(j) > 'fuU) for all j. See, for example. Section l.B of [23 . 

In proving Theorem ll.il Lemma [4.11 gave conditions under which V + X <st W + X, 
which then allowed us to deduce a compound geometric approximation bound. In the 
case of geometric approximation, we use the hazard rate order to express conditions 
under which this stochastic ordering holds. 

If X = 1 a.s. (so we are in the geometric approximation case, and Hp{X) = p), then 
fl4.2p tells us that V + 1 <st VE -|- 1 if 


rwU) > 


P 


P 


= ^nU) , 


where p = P(hE = 0) and N ~ Geom(p). That is, if W <hr N then R -|- 1 <st VE -l- 1 and 
the bound of Theorem 1 1.1 1 holds with X = 1 almost surely. A similar argument shows that 


14 




















if N <hr W then W + 1 <st V + 1 and we obtain an analogous geometric approximation 
result. In fact, we have the following. 

Theorem 5.2. Let W be a nonnegative, integer-valued random variable with p = P(hh = 
0). Let N ~ Geom{p) and suppose that either W <hr N or N <hr W. Then 


dTvmW),C{N)) < |l-p(l + ElT)| . 


To illustrate this result, we return to the setting of Section and let {N{t) : t > 0} 
be a Poisson process of rate A and T be a nonnegative random variable independent of 
{N{t) : t >0}. We have the following. 

Corollary 5.3. Let{N{t) : t > 0} andT be as above. Letp = and p = Xp{l—p)~^. 

Let rj ~ Exp{p) have an exponential distribution with mean and suppose that either 
T <hr p or rj <hr T. Then dTv{C{N{T)), Geom{p)) < Xp\p-^-ET\. 


Proof. We note that p = ¥{N{T) = 0), EiV(T) = AET, and that N(p) ~ Geom(p). The 
bound follows from Theorem 15.21 if either N{T) <hr N{p) or N{p) <hr NjT). 

Consider hrst the inequality N{T) <hr N{p). Using Theorem 1.B.14 of [23|, this holds 
if T <hr P- To see this, we need to verify that if ~ Po(a) has a Poisson distribution 
with mean a then <fir Z^ whenever a < ft. This is most easily checked by noting that 


•^OL 


= j)¥{Za = i)~^ is increasing in j, and then using Theorem l.C.l of 
required hazard rate ordering. 

Similarly, if p <hr T then N{p) <hr N(T) and the stated upper bound holds. 


23j to get the 


□ 
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